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AMENDMENTS TO THE CLAIMS: 

This listing of claims will replace all prior versions and listings of clainns in the 
application: 

1 . (Currently Amended) A speech processing apparatus comprising: 

generation means for generating a pseudo acoustic echo signal for each 
sample , said samples being based on a current impulse response simulating an acoustic 
echo transfer path and on a source signal; 

supply means for holding the current impulse response for each sample and 
supplying the current impulse response to said generation means; 

elimination means for subtracting said pseudo acoustic echo signal from a 
near-end speech signal to remove an acoustic echo component and thereby generate an 
acoustic signal which has been echo-canceled for each sample; 

update means for continually updating the impulse response for each sample 
by using said source signal, said acoustic echo-canceled signal and the current impulse 
response held by said supply means and for supplying the updated impulse response to 
said supply means; 

decision means for checking[[,]] in each frame , said frames beina comprised 
of pluralitv of samples , whether or not a voice is included in the near-end speech signal, by 
using time domain information and frequency domain information of said acoustic signal 
after said acoustic signal has been echo-canceled, said decision means outputting a result 
indicating whether said voice is included in the near-end speech signal , whoro i n said n e ar - 
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o nd spe e ch s i gna l i ncludGS compononts of tho aoouctic ocho from a known courco, tho 
voic e and an add i tiv e background no i s e from an unknown sourc e; 

storage means for storing one or more impulse responses in each frame; 

[[and]] 

control means for, in a frame for which the result of decision made by said 
decision means is negative, storing in said storage means the current impulse response 
held by said supply means and, in a frame for which the result of the decision is positive, 
retrieving one of the impulse responses stored in said storage means and supplying the 
one of the impulse responses to said supply meansfF.II : and 

means for determining a spectrum for each frame bv performing the Fourier 
transform on said acoustic echo-canceled signal; 

means for successivelv determining a spectrum mean for each frame based 
on the spectrum obtained; and 

means for successivelv subtracting the spectrum mean from the spectrum 
calculated for each frame from said acoustic echo-canceled signal to remove additive noise 
of an unknown source. 

2. (Original) A speech processing apparatus as claimed in claim 1, wherein said 
acoustic echo-canceled signal is used for speech recognition. 

3. (Cancelled). 
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4. (Currently amended) A speech processing apparatus as claimed in claim [[2]] 
1, further comprising: 

moans for dotorm i ning a spectrum for each frame by performing tho Fourior 
transform on sa i d acoust i c echo cancolod signal; 

moans for succoss i vo l y dotorm i n i ng a opoctrnm moan for e ach frame basod 
on th e spectrum obtain e d; 

moans for succ e ss i v el y subtract i ng th e sp e ctrum m e an from th e sp e ctrum 
ca l cu l at e d for each fram e from sa i d acoust i c e cho canc e l e d s i gnal; 

means for determining a cepstrum from the spectrum, the spectrum be i ng 
r e mov e d of having the additive noise of an unknown source removed by said subtraction 
means; 

means for determining for each talker a cepstrum mean of a speech frame 
and a cepstrum mean of a non-speech frame, separately, from the cepstrums obtained; 
and 

means for subtracting the cepstrum mean of the speech frame of each talker 
from the cepstrum of the speech frame of the talker and for subtracting the cepstrum mean 
of the non-speech frame of each talker from the cepstrum of the non-speech frame of the 
talker to correct in a lump multiplicative distortions that are dependent on microphone 
characteristics and spatial transfer characteristics from the mouth of the talker to the 
microphone, wherein said means for subtracting comprises first subtracting means for 
subtracting the cepstrum mean of the speech frame of each talker from the cepstrum of the 
speech frame of each talker and second means for subtracting the cepstrum mean of the 
non-speech frame of the talker and by said first subtracting means and said second 
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subtracting means, said subtracting means corrects in a lump multiplicative distortions that 
are dependent on a microphone characteristics and spatial transfer characteristics from the 
mouth of the talker to the microphone. 

5. (Currently amended) A speech processing apparatus as claimed in claim [[2]] 
1, further comprising: 

moans for dotorm i ning a spectrum for oach frame by performing tho Fouri e r 
transform on said acoust i c e cho canc ele d signal; 

means for determining a cepstrum from the spectrum obtained; means for 
determining for each talker a cepstrum mean of a speech frame and a cepstrum mean of a 
non-speech frame, separately, from the cepstrums obtained; and 

means for subtracting the cepstrum mean of the speech frame of each talker 
from the cepstrum of the speech frame of the talker and for subtracting the cepstrum mean 
of the non-speech frame of each talker from the cepstrum of the non-speech frame of the 
talker to correct multiplicative distortions that are dependent on microphone characteristics 
and spatial transfer characteristics from the mouth of the talker to the microphone. 

6. (Cancelled). 
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7. (Currently Amended) A speech processing method comprising: 

a generation step for generating a pseudo acoustic echo signal for each 
sample , said samples being based on a current impulse response simulating an acoustic 
echo transfer path and on a source signal; 

a supply step for holding the current impulse response for each sample and 
supplying the current impulse response to said generation step; 

an elimination step for subtracting said pseudo acoustic echo signal from a 
near-end speech signal to remove an acoustic echo component and thereby generate an 
acoustic signal which has been echo-canceled for each sample; 

an update step for continually updating the impulse response for each sample 
by using said source signal, said acoustic echo-canceled signal and the current impulse 
response held by the supply step and for supplying the updated impulse response to said 
supply step; 

a decision step for checking[[,]] in each frame , said frames being comprised 
of plurality of samples , whether or not a voice is included in the near-end speech signal, by 
using time domain information and frequency domain information of said acoustic signal 
after said acoustic signal has been echo-canceled, said decision step outputting a result 
indicating whether said voice is included in the near-end speech signal , whoro i n sa i d n e ar - 
e nd sp ee ch signa l i nc l ud e s compon e nts of th e acoust i c e cho from a known sourc e , th e 
vo i co and an addit i ve background no i s e from an unknown sourc e; 

a storage step for storing one or more impulse responses in each frame; 

[[and]] 
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a control step for, in a frame for which the result of decision made by said 
decision step is negative, storing in said storage step the current impulse response held by 
the supply step and, in a frame for which the result of decision is positive, retrieving one of 
the impulse responses stored in said storage step and supplying it to said supply step[[.]]; 

a step for determining a spectrum for each frame by performing the Fourier 
transform on said acoustic echo-canceled signal: 

a step for successively determining a spectrum mean for each frame based 
on the spectrum obtained: and 

a step for successively subtracting the spectrum mean from the spectrum 
calculated for each frame from said acoustic echo-canceled signal to remove additive noise 
of an unknown source. 

8. (Original) A speech processing method as claimed in claim 7, wherein said 
acoustic echo-canceled signal is used for speech recognition. 

9. (Cancelled). 
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10. (Currently amended) A speech processing method as claimed in claim [[8]] 7, 
further comprising: 

a stop for dotorm i n i ng a spectrum for oach frame by perform i ng tho Four i er 
transform on sa i d acoust i c e cho - canc ele d s i gna l ; 

a stop for successively dotorm i ning a sp e ctrum m e an for each fram e bas e d 
on tho spectrum obta i n e d; 

a step for succ e ssiv el y subtract i ng th e sp e ctmm m e an from tho sp e ctrum 
calcu l ated for oach framo from sa i d acoustic ocho canc el ed signa l to romovo additive no i se 
of an unknown sourc e ; 

a step for determining a cepstrum from the spectrum removed of the additive 

noise; 

a step for determining for each talker a cepstrum mean of a speech frame 
and a cepstrum mean of a non-speech frame, separately, from the cepstrums obtained; 
and 

a step for subtracting the cepstrum mean of the speech frame of each talker 
from the cepstrum of the speech frame of the talker and for subtracting the cepstrum mean 
of the non-speech frame of each talker from the cepstrum of the non-speech frame of the 
talker to correct multiplicative distortions that are dependent on microphone characteristics 
and spatial transfer characteristics from the mouth of the talker to the microphone. 
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1 1 . (Currently amended) A speech processing method as claimed in claim [[8]] 7, 
further comprising: 

a ctop for dotormin i ng a spectrum for each framo by p e rform i ng the Four ie r 
transform on sa i d acoustic e cho - canc ele d s i gnal; 

a step for determining a cepstrum from the spectrum obtained; a step for 
determining for each talker a cepstrum mean of a speech frame and a cepstrum mean of a 
non-speech frame, separately, from the cepstrums obtained; and 

a step for subtracting the cepstrum mean of the speech frame of each talker 
from the cepstrum of the speech frame of the talker and for subtracting the cepstrum mean 
of the non-speech frame of each talker from the cepstrum of the non-speech frame of the 
talker to correct multiplicative distortions that are dependent on microphone characteristics 
and spatial transfer characteristics from the mouth of the talker to the microphone. 

12. (Cancelled). 

13. (New) A speech processing method comprising the steps of: 
applying a normalized least mean square error algorithm, controlled by a 

near-end talk detection algorithm based on a frame by frame basis voice activity 
detection algorithm, to an input signal to create an acoustic echo-cancelled signal; 
and 
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applying a continuous spectral substitution algorithm to each frame of said 
acoustic echo-cancelled signal to remove an unknown noise source from said 
acoustic echo-cancelled signal. 

14, (New) A speech processing system comprising: 

means for applying a normalized least mean square error algorithm, 
controlled by a near-end talk detection algorithm based on a frame by frame basis 
voice activity detection algorithm, to an input signal to create an acoustic echo- 
cancelled signal; and 

means for applying a continuous spectral substitution algorithm to each frame 
of said acoustic echo-cancelled signal to remove an unknown noise source from 
said acoustic echo-cancelled signal. 
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